Construction of language models for an handwritten mail reading system

نویسندگان

  • Olivier Morillot
  • Laurence Likforman-Sulem
  • Emmanuèle Grosicki
چکیده

This paper presents a system for the recognition of unconstrained handwritten mails. The main part of this system is an HMM recognizer which uses trigraphs to model contextual information. This recognition system does not require any segmentation into words or characters and directly works at line level. To take into account linguistic information and enhance performance, a language model is introduced. This language model is based on bigrams and built from training document transcriptions only. Different experiments with various vocabulary sizes and language models have been conducted. Word Error Rate and Perplexity values are compared to show the interest of specific language models, fit to handwritten mail recognition task.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Neural Network Based Recognition System Integrating Feature Extraction and Classification for English Handwritten

Handwriting recognition has been one of the active and challenging research areas in the field of image processing and pattern recognition. It has numerous applications that includes, reading aid for blind, bank cheques and conversion of any hand written document into structural text form. Neural Network (NN) with its inherent learning ability offers promising solutions for handwritten characte...

متن کامل

تشخیص دست‌نوشتۀ‌ برخط فارسی با استفاده از مدل زبانی و کاهش قوانین نگارش کاربر

The Joint-up, cursive form of Persian words and immense variety of its scripts, also different figures of Persian letters depending on their sitting positions in the words, have turned the Persian handwritings recognition to an intense challenge. The major obstacle of the most often recognition ways, is their inattention to sentence contexture which causes utilizing of a word with correct appea...

متن کامل

Investigating the Effect of Scaffolded Extensive Reading as an Anxiety Reducing Strategy in an Iranian EFL Context

Foreign Language Reading Anxiety (FLRA), distinguished as a distinct phenomenon from general language anxiety, has been shown to have a negative impact on reading comprehension skill especially for less proficient EFL learners. FLRA is believed to originate from "unfamiliar writing system" or learners' difficulty in pronouncing words and sentences (Saito, Graza, & Horwitz, 1999). Slow or word b...

متن کامل

Investigating the Effect of Scaffolded Extensive Reading as an Anxiety Reducing Strategy in an Iranian EFL Context

Foreign Language Reading Anxiety (FLRA), distinguished as a distinct phenomenon from general language anxiety, has been shown to have a negative impact on reading comprehension skill especially for less proficient EFL learners. FLRA is believed to originate from "unfamiliar writing system" or learners' difficulty in pronouncing words and sentences (Saito, Graza, & Horwitz, 1999). Slow or word b...

متن کامل

Constructing and Validating a Q-Matrix for Cognitive Diagnostic Analysis of a Reading Comprehension Test Battery

Of paramount importance in the study of cognitive diagnostic assessment (CDA) is the absence of tests developed for small-scale diagnostic purposes. Currently, much of the research carried out has been mainly on large-scale tests, e.g., TOEFL, MELAB, IELTS, etc. Even so, formative language assessment with a focus on informing instruction and engaging in identification of student’s strengths and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012